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1. INTRODUCTION 

At present, when we talk about artificial intelligence, we do not just mean a set of theories; on the 
contrary, it has entered many practical applications [1]-[6]. It has become an essential part of many 
industries. On the other hand, by looking at artificial intelligence algorithms that depend on deep learning, we 
see the potential of these algorithms to solve complex problems because they try to simulate the human mind. 
Also, with advanced deep learning techniques, tampering with media is possible, provided large amounts of 
data are obtained for training. The main concern here is using these artificial intelligence (AI) tools for 
malicious purposes, such as creating vulgar videos or creating false advertising campaigns. In recent years, 
fake multimedia has become a problem that must be focused on and solved, especially after the emergence of 
so-called deepfakes. Deepfakes are images and video clips that were tampered with using artificial 
intelligence [7], such as generative adversarial networks (GANs) [8]-[11] or auto encoders (AEs) [12], [13]. 

The term deepfake came from a Reddit user who converted celebrity faces into porn videos by 
developing a machine learning algorithm [14]. Technology companies and those responsible for social media 
platforms have made a wide effort to uncover so-called deepfakes attacks. For example, Microsoft designed 
software to detect deepfakes. This program checks the images and videos and then generates a percentage 
indicating the extent to which the input material is artificially created [15]. In 2019, Facebook announced the 
deepfake detection challenge (DFDC) in cooperation with major technology companies. This initiative 
stimulated the creation of new methods that detect deepfakes. They created a giant database of more than 
100,000 videos [16]. Facebook also banned all the videos created using artificial intelligence in a way that 
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could not be detected by the users of this platform [17]. The defense advanced research projects agency 
(DARPA) and the air force research laboratory (AFRL) have funded the media forensics research project 
(MediFor), whose purpose is to find solutions and technologies to ensure the integrity of digital images and 
videos [18]. Three main components are considered in the MediFor program, namely: i) digital integrity, 
ii) physical integrity, and iii) semantic integrity [19]. 

In many cases, video evidence is assumed in criminal investigations to be reliable. But with the 
development of technology, where it became possible to tamper with videos, and audio recordings, it is likely 
soon to subject such evidence to a test to ensure its authenticity [20]. Therefore, finding ways to help detect 
deepfakes is of great importance to preserve the legitimacy of digital evidence. 

In this paper, we investigate the detection of these tampered video contents, especially deepfake 
videos. Our method uses four focus measures (Laplacian and gradient) to evaluate the amount of blur in the 
frames belonging to the fake video and compare it to the real video. We focus on assessing the amount of 
blurring because most deepfake applications use blurring to hide the border of the cropped face. Our 
contribution to this work can be summarized as follows: i) we use a simple way to detect the blurring residual 
in fake videos using the focus measures; ii) a small dataset is used to implement our work without using a 
large dataset for training; and iii) our method uses simple measures to detect the forgery instead of using deep 
learning methods, which do not easily detect blurring residuals. 


2. RELATED WORK 

This section will briefly mention some of the works related to deepfake detection and focus on 
methods that explore artifacts in fake video frames. Concerning the detection of deepfakes, image 
manipulation is not a new problem that has been highlighted since the emergence of deepfakes; for example, 
image modification with Photoshop tools is being used up to the present day. Also, multimedia forensics 
science is still dealing with this problem [17]. Deepfake algorithms leave traces on media that are difficult to 
detect with the naked eye, so more research is devoted to helping detect deepfakes. Figure 1 shows the 
number of research papers that have provided solutions in the field of deepfake detection. 
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Figure 1. The number of research papers that are implemented in terms of deepfake detection [21] 


Methods for detecting fakes in a video can be classified into two categories [7]: techniques that 
depend on temporal features and those that rely on visible artifacts within frames. We concentrate on the 
second type, which breaks down the video into frames and then works on each frame by searching for the 
visual artifacts to obtain distinctive properties or features and then feeding them to a classifier. For example, 
Koopman et al. [20] used photo response non-uniformity (PRNU) to detect deepfakes manipulation. The 
analysis of PRNU is an attractive method because it finds that tampering with the facial area region affects 
the local value of the PRNU pattern in video frames. This analysis is widely used in the field of video 
forensics [22]. Compression artifacts are also exploited in this area. It was found that when the manipulated 
JPEG-image is compressed a second time, the whole image has the effects of double compression, except for 
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the fake face area [23]. In addition to the defects generated by recompression, the process of deepfakes also 
generates a set of precious traces, which need post-processing operations to hide those traces or defects. 
Bahrami et al. [24] divided the blurred image into blocks to extract the characteristics of the blur type. 
Finally, these local blur characteristics are combined to classify the image blocks into motion or out of focus. 
These artifacts, especially the artifacts resulting from blurring the border of the fake face, were used in this 
experiment for detecting fakes in videos. 


3. FOCUS MEASURES 

Focus measures (FM) are useful for the measurement of the amount of blur contained in an image. 
Pertuz et al. [25] divided these measures into six categories: gradient-based (GB) and Laplacian-based (LB) 
operators, which are based on the first derivative (gradient) and second derivative to measure the degree of 
focus and the number of edges found in the input image, respectively. Wavelet-based (WB) and 
DCT-based (DCT-B) operators use the fact that the coefficients of the discrete wavelet transform and discrete 
cosine transform can be used to measure the focus level. Statistics-based (SB) operators use image statistics 
to find focus levels. Finally, miscellaneous operators, which do not belong to any of the prior groups, can 
measure the amount of blur founded in the image. 

Our work was based on the focus measures operators mentioned in [25]. The selection fell on the 
Laplacian family because of its good performance in common imaging conditions. Also, we use the 
gradient-based operator because it showed promising results from among the 11 high-performance scales 
tested in [25]. We did not refer to other standards, such as deep cryogenic treatment (DCT), due to their 
dependence on their applications. All focus measures considered in this paper are presented in Table 1, along 
with their exact abbreviation from [25]. 


Table 1. Focus measures used and their abbreviation 


Focus Measure Abbr. 
Energy of Laplacian LAPI 
Modified Laplacian LAP2 
Diagonal Laplacian LAP3 
Tenengrad Variance GRA7 


4. METHOD 
4.1. Dataset 

The dataset used in our experiment is the UADFV dataset [26]. It is simple and has only 49 videos 
with two classes, fake and real. Figure 2 shows a sample of this dataset. We use ten videos from this dataset 
to implement our experiment. 


(b) 


Figure 2. Sample of the dataset used (a) is an example of deepfakes with low quality (notable edges), while 
(b) shows an example of deepfakes with high quality (polished edges) 


4.2. Focus measure operators’ analysis 

Each video in the dataset is converted into a series of frames. This step is followed by cropping each 
frame into eight parts to apply the focus measure mentioned in Table 1 on each part. Figure 3 shows how to 
divide each frame into eight parts. Before applying the focus measure operators to the split image, the 
grayscale channel of the image is taken into consideration. After applying each measure on each part, the 
average value of the eight measure values is calculated for each frame image. This process is repeated for all 
video frames, whether real or fake, to calculate the final value of the focus measure operators. 
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Compute the average of all eight focus measures (FM) values 

Figure 3. Overall steps for analyzing the quality of focus measures. The video frames are cropped into eight 
parts where an average value of the focus measure (FM) is calculated for each frame 
5. RESULTS 


we note that the values of the Laplacian-based operators were lower or higher than their values in the real 
blurring is applied to the image. 


The focus measure for each video was calculated. Figures 4(a)-(d) show the results energy of 
video, depending on the number of edges present in the fake video. We know that the edges decrease as more 


Laplacian, modified Laplacian, Laplacian variance, and Tenengrad variance, respectively. From these results, 
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Figure 4. The results of applying focus measure operators: (a) energy of Laplacian, (b) modified Laplacian, 
(c) Laplacian variance, and (d) Tenengrad variance, per fake videos against real one 


Fast discrimination of fake video manipulation (Wildan Jameel Hadi) 


2586 O ISSN: 2088-8708 


By looking at the results achieved by the gradient-based operator, we find that the value of this 
measure is less than its value in a real video in all cases, whether the fake video is of high quality or low. The 
results indicate that we can use the gradient-based measure to detect deepfakes. This explains the algorithm’ s 
performance in terms of deepfake detection. As for time, the response time of a single video is measured in 
seconds, so if we compare the time taken for all videos, it is negligible in relation to the time taken by deep 
learning-based methods. 


6. CONCLUSION 

In this analysis, a methodology for deepfake detection is suggested. Our method exploits the visible 
artifacts (blur inconsistencies) generated via post-processing steps applied on fake videos. Compared to other 
methods that use deep learning, our method does not require high computational power and extensive data 
for training. Still, it is based only on the comparison of the performance of various focus measure operators. 
The selection of operators has been chosen after a comprehensive review of the results of much recent 
literature. Four focus measure operators are selected to evaluate the degree of focus between fake videos and 
real ones. Various mathematical principles were used in the analysis and testing of the focus measure 
operators. Out of 36 operators, the best four operators were selected to analyze and compare their 
performance. The selection fell on the Laplacian family and the gradient-based operator. Experiments have 
been implemented on a test set of both fake and real videos. Experiments showed that the values of the 
Laplacian-based operators were lower or higher than their values in the real video, depending on the number 
of edges present in the fake video. This means the Laplacian family cannot be used in deepfake detection. On 
the other hand, the gradient-based measure (GRA7) can distinguish fake videos from real ones. The dataset 
used in our experiment is overly small to subedit guidelines for likelihood ratios, as is desired in forensic 
sciences. Nevertheless, we have a simple collection of the focus measures. However, better results could 
have been achieved with more complex collection strategies. Due to the increasing sophistication of deepfake 
technology, it becomes necessary to keep pace with these developments and establish robust detection 
methods. Because the multimedia forensic tools worked on studying and detecting malicious manipulations 
in multimedia content before the occurrence of deep learning technology, so it was necessary to take 
advantage of these methods and combine them with deep learning techniques to build a strong detection 
model. 
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